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ABSTRACT 

Statistical models that combine cross section and time series data offer analysis and 
interpretation advantages over separate cross section or time series data analyses (Matyas & 
Severstre, 1996). Data that embodies time series and cross section units have not been 
commonplace in the research community until the last 25 years (Tieslau, 1999). In this study, a 
fixed effects panel data model is applied to the National Education Longitudinal Study of 1988 
(NELS:88) data to determine if educational process variables, teacher emphasis, student self- 
concept, and socio-economic status can account for variance in student mathematical 
achievement. A model that includes seven independent variables accounted for 25% of the 
variance in student mathematical achievement test score. The primary objective of this study 
was not to explain mathematics achievement, but provide educational researchers with a 
functional and viable model for panel data analyses. 
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A Fixed Effects Panel Data Model: 

Mathematics Achievement in the U.S. 

Data that embodies time series and cross section units have not been commonplace in the 
research community until the last 25 years (Tieslau, 1999). In fact, the U.S. Department of 
Education’s National Center for Education Statistics (NCES) was not mandated to “collect and 
disseminate statistics and other data related to education in the United States” until the Education 
Amendments of 1974 (Public Law 93-380, Title V, Section 501, amending Part A of the General 
Education Provisions Act). Researchers commonly have termed data that contains time series 
and cross section units to be panel or longitudinal data. In this study, these terms are used 
interchangeable. Essentially, panel data is a set of individuals who are repeatedly sampled at 
different intervals in time, across a multitude of cross sectional variables. The term “individual” 
might be used loosely to imply a person, a household, a school, school districts, firms, or a 
geographical region. Figure 1 provides a typical structure for a panel data set. Schools have 
been used to represent the different “individuals”. (Note: the individual unit could just as well 
have been different schools within a particular district, school districts within a state, or an 
aggregate representation by state). 

Researchers who are interested in understanding, explaining, or predicting variation 
within longitudinal data are faced with some complex stochastic specifications. The problem 
that occurs when realizations exhibit two-dimensional variation — variation across time and cross 
section, is model specification. In other words, researchers need to specify a model that can 
capture individual differences in behavior across individuals and/or through time for estimation 
and inference purposes (Greene, 1 997). 
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Figure 1 . Panel Data Structure 

In general, longitudinal (panel) data sets contain a large number of cross-section units and a 
relatively small number of time- series units. 

The U.S. Department of Education began collecting data in 1988 about critical transitions 
experienced by students as they leave elementary schools and progress through high school and 
into postsecondary institutions or the work force. The National Education Longitudinal Study of 
1988 (NELS:88) contains data about educational processes and outcomes pertaining to student 
learning, predictors of dropping out, and school effects on students’ access to programs and 
equal opportunities to learn. The first follow-up was conducted with the same students, their 
teachers, and principals in 1990. The second follow up survey was conducted in 1992, and the 
third in 1994. Data from NELS:88 will be used in this study to determine if student perception 
of educational process variables can account for the variance in mathematical achievement. 



Model Specification 

When should a fixed effects or random effects model be utilized? The answer to this 
question is often debated. Some believe that it is dependent upon the underlying cause in the 
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model. For example, if the individual effects are the result of a large number of non-observable 
stochastic variables, then the random effect interpretation is demanded. Others think the 
decision rest on the nature of the sample - that is when the sample is comprehensive or 
exhaustive, then fixed effects models is the natural choice to enhance the generalizability. On 
the contrary, if the sample does not contain a large percent of the population then the random 
effect model would be the model of choice. According to Hasiao (1985), it is ultimately, “ up to 
the investigator to decide whether he wants to make an inference with respect to population 
characteristics or only with respect to effects that are in the sample” (pg. 131). It is unlikely that 
this debate will ever be resolved per se, however, if the choice between the two underlying 
causes is clear, then the estimation method should be chosen accordingly. However, if the 
choice is not clear, then the decision should be based on the nature of the sample and statistical 
evidence. For example, if the t- stats on the individual effects are mostly significant then this is 
clearly a sign that a significant component of the model is accounted for by the individual effect 
parameter and so FE might be preferred to RE. However, if the Hausman test statistic, a statistic 
designed to test model fit, is large then one might have reason to believe that a RE model is 
preferred. 

Fixed Effects Models 



The fixed effects (FE) model takes co to be a individual specific constant term in the 
regression equation 

Yu = a, + j8 iX 1 „ + j8 zX 2,-, + ... + PkXK u + £u ( 1 ) 

or in matrix notation 

Yi = at + Xu p + £u (2) 
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where 

A. 

The indexes cross-section realizations so that i = 1 ,2,3, ■■■, N and “t” indexes time- 
series realizations so that / = 1, 2,3, ... ,7’. The individual effect a*, is regarding to be constant 

over time (f) and specific to the individual cross-sectional unit (i). The termct,. is presumed to 
captures the unobservable, and non- measurable characteristics that differentiate individual units. 
Basically, this implies that all behavioral differences between individuals (e.g., schools in Figure 
1) are fixed over time and are represented as parametric shifts of the regression function. 

Matayas and Sevestre (1996, p. 34) state, “the intercept is allowed to vary from individual to 
individual while the slope parameters are assumed to be constants in both the individual and time 
dimensions”. The fundamental assumptions of the fixed effects model are: 

(1) E[e„]=0, 

(2) cov(e,. ( ,£;,)= 0, 

(3) var (e,., ) = e[c*] = a] , 

(4) E[e,,^l,]=E[e,,^2,]=E[e,,2r3,]= ... = E[e,,^,]= 0, and 

(5) X H is not invariant. 



Xn'= [Xh, X2 „...XKn] and f = 
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Under these assumptions, the original least squared estimator (OLS) can be use to obtain 
unbiased, consistent, and efficient (BLUE) parameter estimates. 

Random Effects Model 

The random effects (RE) model — also know as the error component model, includes a 
non- measurable stochastic variable, which differentiates individuals. It is written as: 

Y it =(X + j3 1 X 1 it + j3 2 X 2 it + ... + K XK it “P U i + £ it , 

( 3 ) 

or in matrix notation: 

Yi = CC + Xu /3 + u i + Sit , (4) 

p* 

fa. 

The “j” indexes cross-section realizations so that /=1 ,2,3,. N and “t” indexes time-series 
realizations so that t = 1,2,3,. . ., T. The term “m” is a stochastic variable that embodies the 
unobservable or non-measurable disturbances that accounts for individual differences. 
Essentially, the effect is thought to be a random individual effect rather that frxed parameter. 

For example, a researcher might try to discern whether there is difference in achievement 
between districts in the State of Texas. Instead of including every school district in the equation 
(as we would have in the fixed effects model using dummy variables) one can randomly sample 
school districts and assumes that the effect is random distributed across “individuals” but is 
constant through time. 



where: Xn'=\x \u X2u ... XKtt\ and /3 = 
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The fundamental assumptions of the random effects model are as such: 



E[u„Xl,] =E[m„*2,]=E[m„*3,]= ... =E[u i ,XK i ]=0, 

e[ £ J=e[u,.]=o, 

var (ej = e[ £ 2 ] = a], 
var («,.) =e[m, 2 ] =cr u 2 , 
cov(m,.,£,,)=e[m,.,£,,]=ct £ u . 

Assuming normality u, ~ N (0, c 2 ),£„ ~ N (0,cr 2 ), both and “£,.,” are stochastic 
variables, but form one composite error term-called omega (u j + e it ) = (O u . 



Where co„ 



U i + £j 

U, + £,. 



U i + £ i 



1/ ^ 4" £ n 



1/ 2 “I" £ 22 



u 2 + £27- 



w w + £ * 



U N £ jV2 



Ww + £ * 




9 



Fixed Effects Panel Data Model 9 



The error term now consist of two components: (1) the error disturbance £„ , and (2) the 
individual specific disturbance u j . The RE model now takes the form of 



Y it = a + (5 i X h, ♦ p 2 X 2,v ♦ ... + PkXK « + 
or in matrix notation Yi = (X + Xn' P + CQn . 

The error term in the model now exhibits the following characteristics: 

( ^ k 2 a e.u 

var(fi)J= . 

°u,e<yl 



(5) 

( 6 ) 



As it stands, the original least squares estimator may not be applied to equation 6 because the 
error term not longer possess ideal properties (constant variance and zero covariance) thus the 
estimate would be inefficient and, hence generalized least squares (GLS) is appropriate. 

However, the nature of data in behavioral sciences does not permit the variance components 
<T u 2 and a] to be known, therefore, alternative estimation methods must be utilized. One 
common estimation method that can deal with the unknown variance components feasible 
generalized least squares (FGLS). Essential, FGLS takes an estimate of the variance components 
and then estimates the equation. 

The individual effect in the random effect model may, too, be tested with the following 
H 0 :u i = 0 or, equivalent ly, <7 2 = 0 

hypotheses: 

Ha-.o] * 0. 
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After correcting the error term (a ) the t and F-test are reliable, thus inference can be regarded 
as valid. Base on statistical evidence, a FE model will be use estimated and will discussed later 
in the this paper. 

National Education Longitudinal Data Set: 88 

The NELS:88 database is divided into two sections: (1) N2P, and (2) N4P. In this study, 
data were extracted from N2P. A representative sample of students (N= 16,749) enrolled in tenth 
grade in the spring of 1990 who completed a questionnaire in both the first follow-up and second 
follow-up were identified and used in the analysis. 

Seven independent variables are included in the specified model. They are listed as 
entered into the model: (1) Review Work (F2S19BA), (2) ListenLecture (F2S19BB), 

(3) CopyNotes (F2S19BC), (4)Calculators (F2S19BF), (5) Think Problem (F2S20D), 

(6) SES (F2SESl),and (7) Self concept (F2CNCPT). The first four variables, (ReviewWork, 
ListenLecture, Copynotes, Calculators), are frequency measures of student educational processes 
and are scaled as followed: (1) Never/Rarely, (2) 1-2 Times/Month, (3) 1-2 Times/Week, (4) 
Almost each day, (5) Every Day. For example, the variable ReviewWork is a measure of how 
frequently students review their work for the previous day. The variable ListenLecture is a 
measure of how frequently students listen to the teacher’s lecture. The variable CopyNotes is a 
measure of how frequently students take notes. The variable Calculator is a measure of how 
frequently students use calculators. The variable ThinkProblem measures student perception of 
teachers emphases on mathematical objectives and is scaled, (0) none, (1) minor emphasis, (2) 
moderate emphasis, (3) major emphasis. SES is a continuous variable indicating socioeconomic 
status. This measure is based on Duncan’s 1961 socioeconomic index for all occupations. It was 
derived from the parent questionnaire data, the student questionnaire data, or the first follow-up 
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or second follow-up New Student Supplement data. This variable has been standardized to have 
a mean of 0 and standard deviation of 1 . 

Selfconcept is a composite measure of all of the self-concept items (question 66) in the 
student questionnaire. Essentially, this variable measures students’ self concept on a four point 
scale with: (1) strongly agree ,(2) agree ,(3) disagree, (4) strongly disagree. It should be noted 
that this variable was reversed scaled before composite score was created and was standardized 
to have mean of zero and a standard deviation of 1. MathScore is the dependent variable and it 
was derived by Item response theory (IRT) to have a mean of 50 and standard deviation of 10. 

Empirical Results 

The FE model below was specified and estimated. 

(Mathscore f/ ) = /3, +/) 2 (Re viewWork) h + (ListenLecture) n + f3 4 (Copy Notes) it + f 5 (Calculators) u 

+ P 6 (Think 'Problem) jt + (SES) n + /3 g (Selfconcept) u + e it 

Table 1 provides evidence that the FE model is indeed the correct choice over the RE 
model; note all t-stats are significant. Region is the cross section unit (i = 1, 2,3,4) indicating 
which of the four US Census regions (1) Northeast, (2) Midwest,(3) South, or (4) West. 

Table 1 

Estimated Fixed Effects 



Group 


Coefficient 


Standard Error 


t- ratio 


North 








East 


45.84312* 


0.42968 


106.69106 


North 








Central 


45.48305* 


0.44005 


103.35884 


South 


43.06207* 


0.42588 


101.11422 


West 


44.10279* 


0.44472 


99.16970 
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♦Statistically significant at the 99% level. 

Table 2 provides the descriptive statistics, measure of central tendency, measure of 
dispersion, minimum and maximum, and number of realizations. Table 3 provides the 
correlation coefficients for all of the variables used in the analysis. 

Table2 

Descriptive Statistics 



Variable 


Mean 


Std.Dev. 


Minimum 


Maximum 


Cases 


ReviewWork 


3.87 


1.21 


1.00 


5.00 


13577 


ListenLecture 


4.24 


1.03 


1.00 


5.00 


13565 


CopyNotes 


4.01 


1.27 


1.00 


5.00 


13567 


Calculators 


3.69 


1.38 


1.00 


5.00 


13560 


ThinkProblem 


2.23 


0.84 


0.00 


3.00 


13568 


SES 


0.04 


0.81 


-3.24 


2.75 


16563 


Selfconcept 


0.01 


0.70 


-3.69 


1.24 


15123 


MathScore 


51.81 


9.93 


29.50 


71.49 


12992 


Region 


2.56 


1.01 


1.00 


4.00 


16426 
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Table 3 



Correlation Matrix 





Math 

Score 


Review 

Work 


Listen 

Lecture 


Copy 

Notes 


Calculators 


Think 

Problem 


SES 


Self 

Concept 


Region 


Math 

Score 


1.000 


















Review 

Work 


.099 


1.000 
















Listen 

Lecture 


.119 


.369 


1.000 














Copy 

Notes 


.103 


.278 


.535* 


1.000 












Calcu- 

lators 


.148 


.135 


.140 


.095 


1.000 










Think 

Problem 


.181 


.224 


.256 


.237 


.099 


1.000 








SES 


.470 


.042 


.044 


.084 


.103 


.050 


1.000 






Self 

Concept 


.144 


.085 


.085 


.076 


.053 


.134 


.081 


1.000 




Region 


-.103 


.009 


.022 


-.031 


.074 


-.009 


-.082 


.022 


1.000 



Note the moderate correlation between the two exogenous variables, CopyNotes and ListenLecture (p = .540) 
is indicative of multicolinearity. This correlation gives reason to question the inference drawn from the t- stats 
on these two variables. However, the parameter estimates for CopyNotes and ListenLecture are still BLUE. 
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Six out of the seven independent variables were statistically significant at the 0.000 alpha level 
accounting for 25% of the variance in the dependent variable (mathematics achievement score). 
See Table 4 for parameter estimates. 

Table4 

Fixed- Effects Estimates 



Variable 


Coefficient 


Standard Error 


|b/St.Er 


P[|Z|>z] 


ReviewWork 


0.2342* 


0.0716 


3.272 


0.0011 


ListenLecture 


0.3704* 


0.0944 


3.924 


0.0001 


CopyNotes 


-0.1090 


0.0732 


-1.488 


0.1367 


Calculators 


0.6669* 


0.6007 


11.100 


0.0000 


ThinkProblem 


1.3877* 


0.0970 


14.304 


0.0000 


SES 


4.9379* 


0.1034 


47.733 


0.0000 


SelfConcept 


1.1370* 


0.1129 


10.071 


0.0000 


R- Square 


.246 








Adjust R- Square 


.245 









* Statistically significant at 99% level. 



The variable ReviewWork is statistically significant (t= 3.27, p > .001). As ReviewWork 
increases by one unit, MathScore increases by 0.234 points, c.p.. In other words, as students 
increase the frequency in which they review their work, holding all else constant, their math 
score increases by 0.234 points. 

The variable ListenLecture is statistically significant (t= 3.924, p > .000). As 
ListenLecture increases by one unit, MathScore increases by 0.370 points. Or put differently, the 
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more attentive the student is to the teacher’s lecture, their math score increases by 0.370 points, 
c.p.. The variable CopyNotes is not statistically significant (t= - 1 .488, p > . 1 367). 

The variable Calculators is statistically significant (t=l 1.10, p>.000). As Calculators 
increases by one unit, MathScore increase by 0.667 points, c.p.. Essentially, this estimate is 
showing that students math score will increase with the use of a calculator. 

The variable ThinkProblem is statistically significantly (t= 14.304, p >.000). Recall, this 
variable measures student perception of teachers emphases on mathematical objectives and is 
scaled, (0) none, (1) minor emphasis, (2) moderate emphasis, (3) major emphasis. As 
ThinkProblm increases by 1 unit, MathScofe increases by 1 .387 point. Abstracted differently, 
the more teachers emphasis “thinking about what a problem means and ways it might be solve”, 
holding all else constant, students math score increase by 1.387 points. 

The variable SES is statistically significant (t = 47.333, p > 0.000). As SES increases by 
1 unit, holding all else constant, MathScore increases by 4.937 points. Recall SES is a 
continuous variable indicating sample member’s socioeconomic status. This measure is based 
on Duncan’s 1961 socioeconomic Index for all Occupations. 

The variable SelfConcept is statistically significant (t=10.071, p>0.000). As SelfConcept 
increases by 1 unit, MathScore increases by 1. 137 points, c.p.. Essentially, students who have a 
more positive self perception, are scoring higher on the standardized math test. 

Conclusions 

In this study, a fixed effects panel data model were applied to the National Education 
Longitudinal Study of 1988 (NELS:88). The empirical evidence presented here suggests that 
student mathematics test score is influence by educational process variables, teacher emphasis, 
student self-concept, and socio-economic status. Specifically, a model that includes seven 




BEST COPY AVAILABLE 

16 



Fixed Effects Panel Data Model 16 



independent variables accounted for 25% of the variance in student mathematical achievement 
test score. The primary objective of this study was not to explain mathematics achievement, but 
provide educational researchers with a functional and viable model for panel data analyses. 

Recommendation 

The NELS:88 data set does not have a means of extracting the time component of the 
data. Although, the models for analyzing time effects were not discussed in the study, it is an 
important aspect of panel data that should be code when the data file is constructed. In addition, 
the time series unit should be measured in smaller periods of time. Further analysis should be 
peformed on the error term (e„ ) . That is, the error term should be analyzed for hetroskedascity 
and autocorrelation and problems. 
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Appendix 



LimDep Code ' 

READ;File=C: \WINDOWS \ Program Files\ES Limdep \ PROGRAM \nels6 . lpj ; 

Nobs = 16 74 9 ; 

Nvar=23 ; 

Names=xl , x2,x3,x4,x5,x6,x7,x8,x9,xl0,xll,xl2,xl3,xl4,xl5,xl6,xl7,xl8, 
xl9 , x2 0 , x2 1 , x22 , x2 3 $ 



SKIP$ 

DSTATS ; RHS = X2 , X3 , X4 , X7 , XI 3 , X19 , X20 , X23 ; OUTPUT = 2 $ 



REGRESS 



Lhs=X22 

Rhs=X2 , X3 , X4 ,X7 ,X13 , X19 , X20 
Str=X23 
Wt S=X1 6 
Panel $ 



REGRESS 



Lhs=X22 

Rhs=X2 , X3 , X4 , X7 , XI 3 , XI 9 , X2 0 



Str=X23 
Panel 
Output=2 
Wt s=X16 
Fixed $ 



Data Output 

SKIP$ 

DSTATS; RHS = X2 , X3 , X4 , X7 , X13 , X19 , X20 , X23 ; OUTPUT = 2 $ 

De scriptive Statistics 
All results based on nonmissing observations. 



Variable 


Mean 


Std.Dev. 


Minimum 


Maximum 


Cases 


X2 


3 .87 029535 


1.21401018 


1 . 00000000 


5 . 00000000 


13577 


X3 


4 .24061924 


1 . 02966250 


1 . 00000000 


5 . 00000000 


13565 


X4 


4 . 00906612 


1 .27206705 


1 . 00000000 


5 . 00000000 


13567 


X7 


3 . 69041298 


1 . 37 809060 


1 . 00000000 


5 . 00000000 


13560 


X13 


2.23349057 


. 839799327 


.000000000 


3 .00000000 


13568 


XI 9 


.48564873 5E -01 


. 811172698 


-3.24000000 


2 . 75000000 


16563 


X20 


. 1 11 9553 0 0E -01 


. 701701693 


-3 .69000000 


1.24000000 


15123 


X23 


2 . 55777426 


1 . 01482683 


1.00000000 


4 . 00000000 


16426 



Correlation Matrix for Listed Variables 
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x2 

X2 1.00000 
X3 .38402 
X4 .28129 
X7 .13514 
X13 .22252 

XI 9 .04671 

X20 .08881 

X23 .00758 

--> REGRESS ; Lhs=X; 

; Rhs=X2 , X3 
; St r=X23 
;WtS=X16 
/Panel $ 



+ + 

| OLS Without Group Dummy Variables | 

| Ordinary least squares regression Weighting variable = X16 | 

j Dep. var. = X22 Mean= 52.55493863 , S.D.= 9.343603189 | 

j Model size: Observations = 10895; Parameters = 8, Deg.Fr.= 10887 j 

j Residuals: Sum of squares= 731652.8662 , Std.Dev.= 8.19782 j 

j Fit: R-squared= .230712, Adjusted R -squared = .23022 | 

j Model test: F[ 7, 10887] = 46 6.44, Prob value = .00000 | 

| Diagnostic: Log -L = -38376.9800, Restricted (b=0 ) Log -L = -39805.8044 | 

j LogAmemiyaPrCrt . = 4.208, Akaike Info. Crt . = 7.046 | 



Panel Data Analysis of X22 [ONE way] 

Unconditional ANOVA (No regressors) 

Source Variation Deg. Free. Mean Square 

Between 20751.0 3. 6917.00 



Residual 

Total 


930327. 
951078 . 


10891 . 
10894 . 


85.4216 

87.3029 


1 

1 




+ 


- + 


+ 


+ 


--+ + 


Variable 


| Coefficient 


| Standard Error 


| b/St . Er . | 


P [ | Z | >z] 


| Mean of x| 




+ 


- + ' 


+ 


+ 


--+ + 


X2 


.2150986215 


. 7 2295926E -01 


2.975 


.0029 


3 .8895953 


X3 


.3746630995 


. 95235646E - 01 


3 .934 


. 0001 


4 .2207458 


X4 


- . 1516385665 


. 73 563678E -01 


-2.061 


. 0393 


3 .9716586 


XI 


. 7025006784 


. 59425942E -01 


11 . 821 


. 0000 


3 .7239903 


XI 3 


1 .398697309 


. 97973819E -01 


14 .276 


. 0000 


2 .2173183 


XI 9 


5 . 040076855 


.10413200 


48 .401 


. 0000 


.867 01895E -01 


X20 


1 . 035652953 


.11377009 


9.103 


.0000 


. 42022957E -01 


Constant 


44 . 54122268 


.41506161 


107.312 


.0000 
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X3 


X4 


XI 


X13 


X19 


X20 


X2 3 


.38402 


.28129 


.13514 


.22252 


. 04671 


■. 08881 


. 00758 


00000 


. 54066 


. 13636 


.26014 


. 04523 


. 08992 


. 01662 


54066 


1 . 00000 


. 09404 


.23307 


. 07737 


. 07659 


- . 03424 


13636 


. 09404 


1 . 00000 


. 10102 


.10769 


.05500 


. 07762 


26014 


. 23307 


. 10102 


1.00000 


. 05100 


. 14062 


- . 01871 


04523 


. 07737 


. 10769 


. 05100 


1.00000 


. 08907 


- . 08566 


08992 


. 07659 


.055 00 


. 14062 


. 08907 


1 . 00000 


. 01745 


01662 


- . 03424 


. 07762 


- . 01871 


- . 08566 


. 01745 


1 . 00000 



,X4,X7,X13,X19,X20 



Least Squares with Group Dummy Variables | 

Ordinary least squares regression Weighting variable = X16 | 

Dep. var. = X22 Mean= 52.55493863 , S.D.= 9.343603189 j 

Model size: Observations - 10895, Parameters = 11, Deg.Fr.= 10884 | 

Sum of squares= 716924.2581 , Std.Dev.= 8.11601 | 

. 24551 
. 00000 
-39805 . 8044 
7.027 I 



Residuals : 

Fit: R-squared= .246198, Adjusted R -squared = 

Model test: F[ 10, 10884] = 355.48, Prob value = 

Diagnostic: Log-L = -38266.1998, Restricted (b=0 ) Log -L 
LogAmemiyaPrCrt . = 4.189, Akaike Info 

Estd. Autocorrelation of e(i,t) -.000540 



Crt.: 



erJc 



20 



Fixed Effects Panel Data Model 20 















- - + + 


Variable 


| Coefficient 


| Standard Error 


| b/St . Er . 


| P [ | Z | >z] 


| Mean of X | 
















X2 


.2342486655 


. 71596894E 


-01 


3 .272 


. 0011 


3 .8895953 


X3 


.3704126069 


. 94402933E 


-01 


3.924 


. 0001 


4 .2207458 


X4 


- . 1089542352 


. 73215840E 


-01 


-1.488 


. 1367 


3 . 9716586 


X7 


.6668605394 


. 6007633 2E 


-01 


11 . 100 


. 0000 


3 . 7239903 


X13 


1 . 387686017 


. 97 0114 2 8E 


-01 


14 .304 


. 0000 


2 . 2173183 


XI 9 


4 . 937094032 


. 10343209 




47.733 


. 0000 


. 8 6 70 1 895E -01 


X20 


1 . 136968803 


. 11289363 




10 . 071 


. 0000 


. 42022957E -01 



+ + 

I Test Statistics for the Classical Model | 







Model Log -Likelihood 


Sum of 


Squares 


R -squared 


(i) 


Constant term only 


-39805 . 80420 


. 9510780165D+06 


. 0000000 


(2) 


Group 


effects only 


-39685 . 63264 


. 9303270109D+06 


. 0218184 


(3) 


X 


- variables only 


-38376 . 97986 


. 7316528662D+06 


. 2307120 


(4) 


X 


and 


group effects 


-38266 . 19962 


. 7 1 6924 2 58 1D+ 06 


. 2461983 










Hypothesi 


s Tests 












Likelihood 


Ratio Test 




F Tests 








Chi -squared d 


. f. Prob. 


F num. denom. 


Prob value 


(2) 


vs 


(i) 


240 . 343 


3 • .00000 


80.975 


3 10891 


. 00000 


(3) 


vs 


(i) 


2857 . 649 


7 .00000 


466.435 


7 10887 


. 00000 


(4) 


vs 


(i) 


3079.209 


10 .00000 


355.481 


10 10884 


. 00000 


(4) 


vs 


(2) 


2838 . 866 


7 .00000 


462 .825 


7 10884 


. 00000 


(4) 


vs 


(3) 


221 . 560 


3 .00000 


74 . 534 


3 10884 


.00000 



+ + 

REGR; PANEL. Could not invert VC matrix for Hausman test. 



Random Effects Model: v( i,t) = e(i,t) + u(i) | 

Estimates: Var [e] = .658696D+02 | 

Var [u] = .489455D+01 | 

Corr [v (i , t ) , v (i , s) ] = .069167 | 

Lagrange Multiplier Test vs. Model (3) = 6971.10 | 
{ 1 df ; prob value = .000000) | 

(High values of LM favor FEM/REM over CR model.) | 
Fixed vs. Random Effects (Hausman) = .23 | 

{ 7 df , prob val ue = 1.000000) | 

(High (low) values of H favor FEM (REM) .) | 

Reestimated using GLS coefficients: | 

Estimates: Var [e] = .663837D+02 | 

Var [u] = . 4 994 45D+01 | 

Sum of Squares . 729365D+06 | 

R -squared .233117D+00 | 



+ 

| Variable 


| Coefficient 


| Standard Error 


| b/St . Er . 


|p[ 


| Z I >z] 


| Mean of 


+ 

X2 


. 2341713422 


. 71596797E -01 


3 .271 




, 00 11 


3 . 8895953 


X3 


.3703833036 


. 94402278E -01 


3.923 




, 0001 


4 . 2207458 


X4 


- . 1090676626 


. 73 213 8 8 9E -01 


-1.490 




. 1363 


3 . 9716586 


X7 


. 6669200976 


. 60069733E -01 


11 . 102 




, 0000 


3 .7239903 


X13 


1 .387746607 


.970 1 13 4 IE -01 


14 .305 




. 0000 


2 .2173183 
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X19 


4 . 937585087 


. 10343032 


47 . 738 


. 0000 


. 86701895E 


-01 


X20 


1 . 136524709 


. 11289247 


10 . 067 


. 0000 


. 42022957E 


-01 


Constant 


44 . 62217113 


1 .1804686 


37 .800 


. 0000 







--> REGRESS ; Lhs=X22 

; Rhs=X2 , X3 , X4 , X7 , XI 3 , X19 , X2 0 
; St r=X23 
; Panel 

; Output =2 
; Wt s=X16 
; Fixed $ 



OLS Without Group Dummy Variables 
Ordinary 
Dep. var. = 

Model size: 

Residuals : 

Fit : 

Model test: 

Diagnostic : 



least squares regression Weighting variable = X16 
X22 Mean= 52.55493863 , S.D.= 9.343603189 

Observations = 10895, Parameters = 8, Deg.Fr.= 10887 



Sum of squares = 731652.8662 , Std.Dev.= 

R-squared= .230712, Adjusted R -squared = 
F [ 7, 10887] = 466.44, Prob value = 

Log -L = -38376.9800, Rest r icted (b=0 ) Log - 



LogAmemiyaPrCrt . = 4.208, Akaike Info. Crt . = 

Panel Data Analysis of X22 [ONE way] 

Unconditional ANOVA (No regressors) 



8 . 19782 
.23022 
.00000 | 
-39805 . 8044 
7.046 I 



Source 


Variatio n 


Deg. Free. 


Mean Square 




1 


Between 


20751 . 0 


3 . 


6917 . 00 




1 


Residual 


930327 . 


10891 . 


85 . 4216 




1 


Total 


951078. 


10894 . 


87.302 9 




1 












+ 




+ + 4 




• + + 


+ 


+ 


Variable 


| Coefficient | 


Standard Error 


| b/St . Er . | P [ | Z | > z ] 


| Mean of X 


1 




+ + 4 




. + + — 


+ 


+ 


X2 


.2150986215 


. 72295926E -01 


2.975 .0029 


3 . 8895953 




X3 


. 3746630995 


. 95235646E -01 


3.934 .0001 


4 . 2207458 




X4 


- . 1516385665 


. 73563678E -01 


-2.061 .0393 


3 . 9716586 




X7 


. 7025006784 


. 5 942 5 942E -01 


11.821 .0000 


3 . 7239903 




X13 


1 . 398697309 


. 97973819E -01 


14.276 .0000 


2.2173183 




X19 


5 . 040076855 


.10413200 


48.401 .0000 


. 86701895E - 


01 


X2 0 


1 . 035652953 


.1 1377009 


9.103 .0000 


. 42022957E - 


01 


Constant 


44 . 54122268 


.41506161 


107.312 .0000 






Least Squares with Group 


Dummy Variables 






+ 

1 


Ordinary 


least squares 


regression 


Weighting variable 


= X16 


1 


Dep. var 


. = X22 Mean 


= 52.55493863 


, S . D . = 9.343603189 


1 



Model size: Observations = 10895, Parameters = 11, Deg . Fr . = 10884 | 

Residuals: Sum of squares= 716924.2 581 , Std.Dev.= 8.11601 

Fit: R-squared= .246198, Adjusted R -squared = .24551 

Model test: F[ 10, 10884] = 355.48, Prob value = .00000 | 

Diagnostic: Log-L = -38266.1998, Restricted (b=0 ) Log -L = -39805.8044 

LogAmemiyaPrCrt . = 4.189, Akaike Info. Crt . = 7.027 | 

Estd. Autocorrelation of e(i,t) -.000540 



+ + 

|Variable | Coefficient 
+ + 



Standard Error 



| b/St . Er . 
+ 



|P[|Z| 
- + 



>z] 



Mean of X| 



o 

ERIC 
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X2 


.2342486655 


. 71596894E 


-01 


3 .272 


.0011 


3 .8895 953 




X3 


.3704126069 


. 94 4 0293 3E 


-01 


3.924 


. 0001 


4 . 2207458 




X4 


- .1089542352 


. 73215840E 


-01 


-1.488 


. 1367 


3 . 9716586 




X7 


.6668605394 


. 60076332E 


-01 


11 . 100 


.0000 


3 . 7239903 




X13 


1 .387686017 


. 97011428E 


-01 


14 .304 


. 0000 


2 .2173183 




X19 


4 . 937094032 


.10343209 




47.733 


.0000 


8 6 701895E 


-01 


X20 


1 . 136968803 


.11289363 




10.071 


.0000 


4 2 022957E 


-01 




Estimated Fixed Effects 














Group Coefficient Standard E rror 


t 


: - ratio 






1 


45 . 84312 




.42968 


106 . 


69106 






2 


45 .48305 




.44005 


103 . 


35884 






3 


43 . 06207 




.42588 


101 . 


11422 






4 


44 . 10279 




.44472 


99 


. 16970 











Test Statistics for the 


Classical 


Model 


1 

| 






Model Log -Likelihood 


Sum of 


Squares 


1 

R -squared | 


(1) 


Constant term only 


-39805 .80420 


. 9510 7 8 0 16 5D+ 06 


.0000000 | 


(2) 


Group 


effects only 


-39685.63264 


. 93 03 2 7 0 10 9D+0 6 


.0218184 | 


(3) 


X 


- variables only 


-38376 . 97986 


. 73 16 52 86 62D+06 


.2307120 | 


(4) 


X 


and 


group effects 


-38266 .19962 


. 7169242581D+06 


.2461983 | 

1 










Hypothesis Tests 




1 

1 








Likelihood 


Ratio Test 




F Tests 


1 






Chi -squared d 


. f. Prob. 


F num. denom. 


Prob value | 


(2) 


vs 


(1) 


240.343 


3 .00000 


80.975 


3 10891 


.00000 | 


(3) 


vs 


(1) 


2857 . 649 


7 .00000 


466 .435 


7 10887 


.00000 | 


(4) 


vs 


(1) 


3079 .209 


10 .00000 


355.481 


10 10884 


.00000 | 


(4) 


vs 


(2) 


2838 . 866 


7 .00000 


462.825 


7 10884 


.00000 j 


(4) 


vs 


(3) 


221 . 560 


3 .00000 


74.53 4 


3 10884 


.00000 | 
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